Visual Foresight Trees for Object Retrieval From Clutter With Nonprehensile Rearrangement

نویسندگان

چکیده

This paper considers the problem of retrieving an object from many tightly packed objects using a combination robotic pushing and grasping actions. Object retrieval in dense clutter is important skill for robots to operate households everyday environments effectively. The proposed solution, Visual Foresight Trees (VFT), intelligently rearranges surrounding target so that it can be grasped easily. Rearrangement with nested nonprehensile actions challenging as requires predicting complex interactions combinatorially large configuration space multiple objects. We first show deep neural network trained accurately predict poses when robot pushes one them. predictive provides visual foresight used tree search state transition function scene images. returns sequence consecutive push yielding best arrangement object. Experiments simulation real approach outperforms model-free techniques well model-based myopic methods both terms success rates number executed actions, on several tasks. A video introducing VFT, experiments, accessible at https://youtu.be/7cL-hmgvyec. full source code available https://github.com/arc-l/vft.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Rearrangement Planning using Nonprehensile Interaction

As we work to move robots out of factories and into human environments, we must empower robots to interact freely in unstructured, cluttered spaces. Humans do this easily, using diverse, whole-arm, nonprehensile actions such as pushing or pulling in everyday tasks. These interaction strategies make difficult tasks easier and impossible tasks possible. In this thesis, we aim to enable robots wit...

متن کامل

Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning

Rearranging objects on a tabletop surface by means of nonprehensile manipulation is a task which requires skillful interaction with the physical world. Usually, this is achieved by precisely modeling physical properties of the objects, robot, and the environment for explicit planning. In contrast, as explicitly modeling the physical environment is not always feasible and involves various uncert...

متن کامل

Visual Multiple-Object Tracking for Unknown Clutter Rate

In most multi-object tracking algorithms, tuning of model parameters is of critical importance for reliable performance. In particular, we are interested in designing a robust tracking algorithm that is able to handle unknown false measurement rate. The proposed algorithm is based on coupling of two random finite set filters that share tracking parameters. Performance evaluation with visual sur...

متن کامل

Spatial Keypoint Representation for Visual Object Retrieval

This paper presents a concept of an object pre-classification method based on image keypoints generated by the SURF algorithm. For this purpose, the method uses keypoints histograms for image serialization and next histograms tree representation to speed-up the comparison process. Presented method generates histograms for each image based on localization of generated keypoints. Each histogram c...

متن کامل

Modeling visual clutter perception using proto-object segmentation.

We introduce the proto-object model of visual clutter perception. This unsupervised model segments an image into superpixels, then merges neighboring superpixels that share a common color cluster to obtain proto-objects-defined here as spatially extended regions of coherent features. Clutter is estimated by simply counting the number of proto-objects. We tested this model using 90 images of rea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE robotics and automation letters

سال: 2022

ISSN: ['2377-3766']

DOI: https://doi.org/10.1109/lra.2021.3123373